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Abstract 

In this paper, the interplay between a class of nonlinear estimators and strategic sensors is studied in several participatory- 
sensing scenarios. It is shown that for the class of estimators, if the strategic sensors have access to noiseless measurements of 
the to-be-estimated-variable, truth-telling is an equilibrium of the game that models the interplay between the sensors and the 
estimator. Furthermore, performance of the proposed estimators is examined in the case that the strategic sensors form coalitions 
and in the presence of noise. 


I. Introduction 

With a yearly expansion rate of 30% for mobile broadband subscriptions and smartphones accounting for 65-70% of all 
the sold mobile phones jTj, we are truly living in a connected world. This constant state of connectedness has enabled new 
technologies, such as participatory- and/or crowd-sensing applications, in which consented participants, with their smartphones, 
act as sensing units to estimate a variable^ Examples of commercial products using participatory-sensing schemes are Waze 
(for traffic estimation), Sensorly (for mobile coverage), Amazon review (for quality of service or product), or Mobile Water 
Management (for user data collection in control of irrigation canals) These systems, so far, have mainly relied on the 

benevolence of participants. However, due to various reasons, participants might provide false data. For instance, smartphones 
can be hacked or fake hots can be created by a hacker to feed false measurements to a sensing scheme |[4j. Another reason 
could be that the individual participants might want to change the outcome of participatory-sensing schemes for their benefit. 
For instance, in crowd-sensing applications for traffic estimation, such as Waze, people inevitably realize that their reports 
change the traffic estimates which, in turn, diverts vehicles to and from their neighbourhoods |5 ]. Finally, it could also be that 
the participants wish to retain their privacy. For instance, people might provide inaccurate and misleading reports to a polling 
organization (one of the oldest forms of crowd-sensing applications) to avoid revealing private and/or sensitive information 
to governments or for-profit organizations. Therefore, we are interested in designing robust participatory-sensing schemes that 
can work reliably in the face of strategic false-data injection. 

An earlier study in estimation with strategic sensors showed that, counter-intuitively, the performance of linear estimators 
degrades as the number of sensors increases |6j. Further, it was shown that when the sensors are herding, i.e., they are imitating 
each other’s policies, the quality of the estimation improves with the summoning of more sensors. Herding behaviour could be 
caused by the bounded rationality of the sensors. However, it can be also induced by designing an appropriate estimator that 
pushes each sensor to “behave the same as the rest”. Here, to utilize this observation, we design nonlinear estimators that can 
reject outlier reports and, hence, induce a herding behaviour among the sensors. A median estimator is an example of such an 
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] In this paper, we use the terms state, variable, and to-be-estimated-variable interchangeably based on the context. 
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estimator. The fact that median is robust in the presence of noisy and corrupted data is well known, e.g. see 0 - However, to 
the best of our knowledge, the benefits of using a median estimator in the presence of strategic sensors and in a game-theoretic 
framework has not been formalized in the literature. 

The main contributions of the paper are as follows. First, two linear estimators are considered and it is shown that sensors 
engaging in a truth-telling behaviour, i.e. reporting the correct variable that the participatory-sensing scheme wishes to estimate, 
does not correspond to an equilibrium of the game modelling the interaction between the sensors and the estimator. Next, it is 
demonstrated that for a class of nonlinear estimators truth-telling is in fact an equilibrium of this game. Third, the scenarios 
where strategic sensors might be colluding and form coalitions is studied. Particularly, the link between the size of the coalitions 
and the performance of the estimators in the aforementioned class is established. Fourth, the case where the measurements 
carried out by the sensors are noisy is studied. 

Many participatory-sensing mechanisms adopt quality assessment procedures and provide appropriate incentives to extract 
useful data. In this paper, however, we show that even if the participatory-sensing services do not adopt quality-assessment 
procedures or provide incentives, they can shape the behaviour of strategic sensors to elicit a truthful message. This is certainly 
a favourable approach as (i) there is no need for incentives (in either monetary or psychological forms) in the often large 
groups of recruited participants and (ii) it further reduces the complexity of the employed estimators. 

The rest of the paper is as follows. In Section [II| we investigate the interplay between the strategic sensors and the linear as 
well as nonlinear estimators. Section [Hl| extends these results to the case where the sensors can form coalitions. In Section [TV] 
we study nonlinear estimators when the sensors are noisy. Finally, we conclude the paper in Section [V] 


II. Noiseless Measurements 


Let us consider the case where a receiver is interested in measuring the variable x £ R. Hence, it employs n > 1 sensors 


to measure this variable and report it back. Each sensor transmits a measurement yi £ R. We assume that the transmissions 
occur simultaneously and, thus, the sensors do not have access to the messages communicated by each other. The receiver 
subsequently uses these messages to construct an estimate of x, denoted hv x € R. The receiver wants to guarantee that the 
estimation error E{||x — is small, where E{-} denotes the expectation of its argument. In this section, we assume that 
the sensors have access to the noiseless measurements of x. However, their interests are not aligned with that of the receiver 
and each other, that is, sensor i £ [?r] wishes to minimize the cost E{||(a;-(-0i) — £’|| 2 }, where Oi £ R. is its private information 
(i.e., it is not known by the other sensors and the receiver). Here, [n] denotes the set {1, ... ,n}. 

Assumption 2.1: X and ((4)i e [ ra j are jointly distributed Gaussian random variables with zero mean. 

We are interested in large groups of homogeneous sensors to mimic the behaviour of large crowds. Therefore, we make the 
following assumption. 

ASSUMPTION 2.2: (#i)ie[ra] are identically and independently distributed random variables. Moreover, x and 9i are inde¬ 
pendent for all i £ [n]. 

Let gi(-\x,di) denote the conditional distribution that sensor % uses for generating its message yi (i.e., its policy). Therefore, 
for any Lebesgue-measurable set 34 C R, we get 
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We use the notation Qi to denote the set of all such conditional distributions for each sensor i. Moreover, since the dimension 
of the messages for all the sensors is the same, Q 1 = • • • = Q„ = Q. Let us define a cost for sensor i £ [n] as 


Vi{(gi)ie h;tt) = E{||(a: + ^*) -7r((y*)ie[n])||2} 

/ OO pOO 

■ ||(a: + 6i) — 7r((2/i)i£[ n] )||2 f[ \9j(yj\ x ^j) d Vj] II \Pe(^j) de j]Px(x)dx, 

■°° je[n] jew 

where (ffi)»e[ n ] £ Q n is the policy of all the sensors, tv : R” —>• R is the policy of the receiver, i.e., x = 7r((2/*)»e[n])> and p x 

and pg are probability density functions. Note that this is an ex ante cost function as the sensors do not wait until they receive 

their private information and the state measurement in forming the cost function (it leads to a setup in which the sensors select 

their policies before entering the game). Hence, at the equilibrium induced by this cost function, the parameters of the policy 

are not a function of the private information and the state measurement, however, the transmitted message can be a function 

of these measurement^] Alternatively, we can form an ex post cost function 

j n ) = EdlOr + tfi) -7r((y i ) ie [ n ])|| 2 |0i ) a;} 


"3 J* 


= / ■■•/ ||(ar + 6» i )-7r((2/ i )i e [ n ])|| 2 [gjiy^x^^dy^ \pe{0j)dOj 

J ~ 00 ielnj i£W\W 

Definition 1 (7t-Stackelberg Equilibrium): A tuple of conditional distributions ( g *) ig [ n ] £ Q n constitutes an ex ante 
7r-Stackelberg equilibrium if 


9j e argmin Vj(g h (s , *)ieH\D}; 7r )> e N- 

The tuple constitutes an ex post 7r-Stackelberg equilibrium if 

g* £ arg min Uj (gj , (g*)ieln}\{jy, *), Vj £ [n], 

9j eS 

In statements where we do not distinguish between ex ante and ex post equilibria, the statement holds in both senses. 

Note that tv, in 7r-Stackelberg equilibrium, is a generic place-holder for an arbitrary policy n and indicates that the equilibrium 
corresponds to this policy. 

In this paper, our interest is to find an estimator n that can extract useful information from strategic sensors. This is motivated 
by the observation that, for linear estimators, the quality of the estimation degrades as the number of participating sensors 
increases Let us start with a simple, yet widely used, linear estimator to illustrate the problem. 

Definition 2 (Averaging Estimator): The receiver employs the estimator 


x = ip ((Vi) ielnj) == ( 2/1 4-h y n )/n. 

A tuple of conditional distributions ((g* )ieM ) £ G n constitutes an ex ante (ex post ) equilibrium for the averaging estimator 
if it is an ex ante (ex post ) L-Stackclberg equilibrium. 

Definition 3 (Truth-Telling Portfolio): Sensor i follows the truth-telling strategy i0c/i(t/i|iE, 9i) = 6(yi—x), where 
6 is the Dirac delta distributiorj^] The truth-telling portfolio is a tuple of conditional distributions such that all the sensors are 
employing the truth-telling strategy. 

Now, we can prove the following negative result regarding the averaging estimator. 

2 Using an ex ante optimal policy can be motivated by the lack of enough computational resources for online calculation of the policy based on the realization 
of the private information and the state. 

3 This is the same as saying P { y r = x} = 1. 

4 The Dirac delta distribution is a mapping 5 : R —» 1 U {Too} such that S(t) = 0 for all t £ R \ {0} and S(t.)dt = 1. 
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Theorem 4: The truth-telling portfolio is not an equilibrium for the averaging estimator. 

Proof: Let all the players except player i £ [n] employ the truth-telling strategy. Therefore, yj = x for all j £ [n] \ {*}. 
Hence, we have x = (1 — l/n)x + (l/n)yi. Now, sensor i using the policy y t = x + nQi results in E{\\(x + Of — x\\2\9i, x} = 0 
and E{||(cc + 9i) — x|| 2 } = 0, which is strictly less than, respectively, substitution of the truth-telling strategy in both ex post 
and ex ante cost functions. ■ 

Even with the optimal linear estimator E { x | y 1 ,.... y n }, it was observed in j6] that the truth-telling is not an equilibrium 
of the game and that, at the equilibrium, the quality of the estimation degrades as the number of sensors increases. In limit, 
no information can be recovered from the transmitted messages. This observation, together with Theorem [4] motivates us to 
find estimators for which the truth-telling portfolio is an equilibrium. We address this concern in the remainder of this section. 
Definition 5 (2£-Rejection Averaging Estimator): Assume that n> 2£+l. Let (tj)je[n] be given such that Vh < 


Ui 2 < ‘' ’ < Ui n ■ The receiver employs the estimator 


x = i/>e((yi)i e[r»]) 


1 

n- 2£ 


n—i 


Y v*y 

j=e +1 


A tuple of conditional distributions ((<?*)ie[n]) £ G n constitutes an ex ante (ex post) equilibrium for the 2f-rejection averaging 
estimator if it is an ex ante (ex post) ^-Stackelberg equilibrium. 

Remark 2.1: In the statistics literature (e.g., j8, p. 16]), 2/-rejection averaging estimators are alternatively known as trimmed 
averaging estimators (since they are derived from an averaging estimator by excluding the extreme values). Here, we use the 
name 2f-rejection averaging estimator because of our desire to work with integer values of £ rather than percentages of rejection 
i/n. 

Theorem 6: The truth-telling is an equilibrium for the 2/'-rejection averaging estimator for all t £ IL(n-l)AIJ. 

Proof: Let all the players except player i £ [n] employ the truth-telling strategy. Therefore, yj = x,\/j £ [n] \ {;«}. 
Hence, we have x = x irrespective of y, (as it will be rejected). Thus, truth-telling (among all the other policies) minimizes 
the cost of sensor i in both ex ante and ex post senses. ■ 

Definition 7 (Median Estimator): Let (*j)je[[n] be given such that y n < yt 2 < • • • < y ln . The receiver employs the 
estimator 




(yi„,2 + yi n ,2+i)/ 2 


n £ E, 


[ !/»(„+i V 2. n£ O, 

where E and O represent the sets of even and odd integers, respectively. A tuple of conditional distributions ((ff*)ig[ n ]) £ G n 
constitutes an ex ante (ex post) equilibrium for the median estimator if it is an ex ante (ex post) </>-Stackelberg equilibrium. 
Lemma 8: The median estimator is equivalent to the 2£-rejection averaging estimator if £ = (n - 2)/2 for n £ E and 

£ = (n — l)/2 for n £ O. 

Proof: The proof follows from simple algebraic manipulations and is hence omitted. ■ 

Corollary 9: The truth-telling is an equilibrium for the median estimator. 

So far, we have assumed that the sensors do not form coalitions to deceive the receiver. In the next section, we define a 
different game in which sensors can act together. 


III. Extension to Coalitions 

Assume that sensor i £ [n] can submit Cj £ N messages instead of one. This setup has two interpretations. First, each sensor 
represents a coalition of size c* instead of a single sensor. Alternatively, we can assume that each sensor represents an array of 
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sensors introduced by a single hacker. Therefore, the receiver has access to (?/i)ie[c] where c = X^e[n] C P The rece i ver does 
not know (Cj)je[[n] • Here, the definition of the 2/'-rejection averaging estimator is the same as in Definition |5j with c denoting 
the number of messages instead of n. 

Examples of participatory-sensing services that admit coalitions are legislative bodies, e.g. the U.S. congress, and truth¬ 
finding committees, e.g. royal commissions in most of the Commonwealth countries. Here, the coalitions are political parties 
because their members most often, persuaded by the party whip, vote on party lines. Therefore, it would be nice to construct 
estimators that can recover the truth despite the ever-growing partisanship. 

THEOREM 10: Let X^je[Ti]\{i} c o > c* + 1 for all i £ [n]. The truth-telling is an equilibrium for the 2f-rejection averaging 
estimator if max l£(n] a < £ < L(Eje[„] c j ~ l )/ 2 \- 

Proof: The proof is similar to the proof of Theorem [6] and is hence omitted. ■ 


Remark 3.1: Theorem 10 shows that, for a given t, the estimator is robust to admitting a coalition of sensors with the 
size of, at most, £ assuming that no coalition has more sensors than the sum of the size of all other coalitions minus one (i.e., 
there is a balance of power between the competing coalitions). 

Corollary 11: Let 


mm 

*£[«] 


E 


> 


1 > 

EeW c ^ E - 


yfeM]\{»} 

The truth-telling portfolio is an equilibrium for the median estimator. 

Remark 3.2: Corollary [TT] shows that the truth-telling portfolio is an equilibrium if no individual coalition has the majority. 
Thus, the median estimator is extremely robust to manipulation by strategic entities even if they cooperate. This observation 
has interesting implications in politics, that is, any truth finding committee, as a whole, can only recover the tmth so long as 
no single party has the majority because, in such case, they can silence the voice of the others. 


IV. Extension to Noisy Measurements 

Consider the case where sensor i £ [n] has access to noisy measurements of the state denoted by z- t = x + Wi, where 
(wf) ig[ n ] are independent zero-mean Gaussian random variables. Similarly, sensor i uses the conditional distribution gi(-\zi, Of) 
to generate its message yi. In this case, we say that sensor i follows the truth-telling strategy if gi(yi\zi,9i) = S(y.i — zf). 
Unfortunately, access to noisy measurements destroys the truth-telling property of the median estimator. 

Theorem 12: The truth-telling portfolio is not an equilibrium for the 2£-rejection estimator in the presence of noise. 

Proof: Let us pick a sensor k £ [n]. Set yj = Zj for all j £ [n] \ {fc}. Assume that y *. = (n — 2£)9k + Zfc. Let 
be given such that ij k for all 1 < j < n — 1 and < yi 2 < • • • < t/j n _ 1 . Note that k £ {£ + 1,n — £} 
with a positive probability. In that case, we have x = l/(n — 2£) Ej=r+i Vi, - This results in a cost equal to E{||(a: + 9k) — 
xh\zk,6 k ,yi, = E{||x - l/(n - 21) J2j f( +[ z i:i WfzkjOk-, y -\,... ,y n } which is strictly smaller than sensor k’s cost 

had it been truthful: E{||(x + 9k) — 1 /{n — 2£) Ej=r+i Zi j Ibl-fe, 9k, yi, ■ ■ ■, y n }- Hence, by taking expectation of these terms 
over yi,.... y rll we can show that the cost of sensor k can be reduced by not acting truthfully. This concludes the proof. ■ 

A similar result can be proved for the median estimator. 

Theorem 13: The truth-telling portfolio is not an equilibrium for the median estimator in the presence of noise. 
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Proof: Let n £ O as, with a similar idea, we can prove the result for n £ E. Pick a sensor k £ [n]. Let be given 

such that ij 7 ^ k for all 1 < j < n — 1 and y ix < y i2 < ■ ■ ■ < ^ , Hence, we have 


y*(n- 1)/2> 2 / fc ^ 2/ i (n-l)/2> 


x = 


2 /fe; 


yi(n~ i)/2 — yk — y^(n+i)/2‘‘ 


,2/i(™+i)/2’ y^ ^ 1)/2> 


This gives 


E{||(x + 6 > fc ) - x|| 2 |yi, • • ., 2 /n} = < 


E{||(tr + 6> fe ) — 2/i ( „_ 1)/2 1| 2 }, 2/fc < y* (n _ 1)/2 , 

e{||(x’ + Ok) — z/feII2}, t/i (n _ 1)/2 < yk < yi (n+1)/2 , 

,E{||(x + #*;) — t/i ( „ + i )/2 1| 2 }, yk > 2/i(„ +1)/ 2 - 

Selecting = E{x + 2 ;^} = Zfc + $& minimizes E{||(x + 0fc) — 3 /*;11 2 }- This results in a strictly smaller cost than using 

a truthful strategy since j/i (n _ 1)/2 < Zfe + 9k < y?: ( „ +1)/2 occurs with a positive probability. This concludes the proof. ■ 

Although truth-telling is no longer an equilibrium, we can characterize another equilibrium that can reveal some information 
about the to-be-estimated-variable. 

Remark 4.1: Notice that, without loss of generality, we can consider an odd number of measurements because we can 
always transform an even number of measurements into an odd number by either discarding a measurement randomly or by 
introducing a very large or a very small measurement (that always gets discarded). 

Theorem 14: Let n £ O. The tuple (y*)ie[ n ] £ G n defined as g*(iji\z l: Of) = 5(yi — (Zi +Qif), i £ [n], is an equilibrium 
for the median estimator in the presence of noise. 


Proof: Following the same line of reasoning as in the proof of Theorem 13 selecting = Zk + 9k is the best response 
of each sensor irrespective of the others in both ex ante and ex post senses. ■ 

Remark 4.2: It is interesting to note that reporting yk = Zk + 9k is a dominant strategy, i.e., it is in the benefit of players 
irrespective of other reports (when the players do not form collations). Therefore, even if some sensors are randomly-behaving 
or faulty, the rational ones report Zk + Ok. 


Remark 4.3: The captured equilibrium in Theorem 14 is not unique. This can be observed from the fact that the sensors 
can employ any stochastic or deterministic mappings for constructing their messages yk when it is very large or very small, 
since the message will be discarded regardless and has no impact on the outcome of the estimation. 

Proposition 15: The equilibrium in Theorem 14 results in lim m _ ) . 00 i „ = 2 m +i E{||x- 0 ((yi) ieW )|| 2 } = 0 . 

Proof: At the equilibrium, we have yk = Zk+9k = x+Wk+9k. Therefore, x = f{{x+Wi+9i) ie ^ n j) = x+f((wi+9i) ie ^ n j). 


From 


0- 


we know that linr m _ ) . 00>n= 2 m +i + 9i ),e[ n ]) = 0, a.s. This concludes the proof. ■ 

Example 1: Consider an estimation problem in which V xx = 1, Vg i g i = 1, and V WiWi =0.1 for all i where V xx , Vg t g t , and 
V WiWi are the variances of x, 0, , and w ,, respectively. When using the median estimator, at the equilibrium in Theorem 14 the 
sensors report yi — Zi+9i,\/i. The blue curve in Fig.[l]shows the estimation error E{||x — <^((j/i)ie[nj)|| 2 } approximated using 
10000 random samples. If, for the signals transmitted at this specific equilibrium, we were to use the averaging estimator, the 
estimation error would follow the red curve in Fig. [T| which is certainly smaller. This degradation in the performance is the 
price of robustness. One way to capitalize on this difference is to lie to the players that the utilized estimator is the median one 
but use an averaging policy and, hence, nudge them towards a good behaviour (from the perspective of the estimator) JTO). 
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Fig. 1. Estimation error variance for median (blue) and mean (red) estimators at the equilibrium captured in Theorem 1 1 4| 


Note that this is applicable if the players cannot infer the correct mechanism by experimenting, e.g., when the players interact 
with a given participatory-sensing scheme very infrequently. 0 

V. Conclusions 

In this paper, the problem of designing participatory-sensing mechanisms is considered. Particularly, it is shown that for a 
class of nonlinear estimators, truth telling is an equilibrium of game modelling the interaction between the sensors and the 
estimator. Later, it is established, for the case where the sensors collude and form coalitions that are no lager than half of the 
total participants, that there is always an estimator, specifically the median estimator, which results in truth telling being an 
equilibrium. Future research can focus on dynamic estimation problems. 
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